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Abstract. We give an informal introduction to the most basic techniques used to evaluate 
moments on the critical line of the Riemann zeta-function and to find asymptotics for sums 
of arithmetic functions. 



1. Introduction 

The simplest way to compute a moment of the zeta-function is to approximate the zeta- 
function by Dirichlet polynomials and then compute the moment of the polynomials. In this 
paper we describe the most rudimentary techniques in this area. Along the way discuss the 
basic methods for finding asymptotics of sums of arithmetic functions, and we also compute 
the arithmetic factor in the standard conjectures for moments of the zeta-function. The 
methods described here are completely standard: our intention is to give a brief introduction 
to those who are new to the subject. The standard reference is Titchmarsh jTj and we cite 
the specific sections where one can look for more details. 

We assume a knowledge of the calculus of one complex variable. Readers unfamiliar with 
the big-0, little-o, and <C notation (and physicists, who may use the -C notation differently), 
should consult the Appendix. 

2. The "first moment" 
The simplest approximation to the zeta-function is 

(2.1) as)= -,-f^+o{T-n, 

^ — ^ n'^ I — s 

l<n<T 

where s = a + it and the equation is valid for \t\ < T. See Section 4.11. Specializing to 
s = I + it, we have 

l<n<T \ 



Now we can find the average value of the zeta-function on the critical line. The justification 
of the steps follows the calculation. 



Jo l<n<T ^ 



1 + ltl 



■dt 



T+Y: -^'-^^ + O (ThogT 



„ln log n 

2<n<T 
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T + T^logT . 



(2.3) 

In the first step we switched the sum and the integral, which is justified because both are 
finite. The sum on the second fine was estimated by 



(2.4) 



E 



1 l-n 
^/n logn 



-iT 



•C 



1 



•C 



1 



y^logn J2 ^/x\ogX 



-dx <C 



logT' 



2<n<T * ° 2<n<T 

which is smaller than the other error term. 

Thus, the zcta-function on the critical line is 1 on average. This is not a particularly useful 
piece of information, for it is the magnitude of the zeta-function which is of interest. So we 
consider the moments of |^(| + it)|. 



3. The 2nd moment 

The simple fact that = zz means that |C(| + ^^)|^ is much more amenable to methods 
of complex analysis when K — 2k is an even integer. The easiest case is the 2nd moment, 
which we compute in this section. 

For later use it will be helpful to first consider the 2nd moment of a general Dirichlet 
polynomial. Suppose 

P^s)= y ^. 



(3.1) 
We have 



l<n<N 



\P{U)f dt 



E 



l<n<Ar 

Or, 



n- 



dt 



-dt 



-it 



= E 



m 
T 



in) * 



(3.2) 



n n^m 



ano^ [{^f - 1) 



log(m/n) 



TM{N)+8{N, T), 



say. We think of T Ai{N) as the main term and £{N, T) as an error term, so we want to 
understand when E{N,T) will be smaller in magnitude than T M.{N). 
Setting m — n-\-hwe can rewrite the error term as 



(3.3) 



anttn+h ({l + ^y^ -1 



log(l + h/n) 

Now consider only the h — 1 term from the above sum: 

(3.4) Qnttn+i ■ n ■ (something bounded). 



l<n<Ar 
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Without any information on a„, the above sum, which is just one part of S{N,T), could be 
about the same size as NA4{N). Thus, in general one should only expect S{N,T) to be 
smaller than T A4{N) if N < T. And if > T then one may need some detailed information 
about the coefficients a„ in order to extract something meaningful from S{N,T). Goldston 
and Gonek ^GGj have given a clear discussion of these issues. 

One can carry through the above calculation to obtain a useful general mean value theorem 
for Dirichlet polynomials. See Titchmarsh [Tj, Section 7.20. However, a stronger result is 
provided by the mean value theorem of Montgomery and Vaughan |MVj : 

2 

lit 



(3.5) 



E 



l<n<Af 



dt= K\HT + 0{n)) 



l<n<N 



This result is best possible for general sequences a„. Note that if = o(T) then the error 
term is smaller than the main term. 

To use the mean value theorem to compute the second moment of the zeta-function, first 
write (Q asC = S + E. That is, 

(3.6) ^=T.-L E- 

l<n<T 

Using 

(3.7) ICI^ = \S + E\^ = {S + E)(S + E) = \S\^ + 2^SE+ \E 




we have 



(3.8) / \Q{\ + it)\^ dt= / \S\^dt + 2'R / SEdt+ / \E\^dt. 
Jo Jo Jo Jo 

We want to evaluate the \S\'^ integral as our main term and estimate the \E\'^ integral as our 
error term, but what to do about the cross term? By the Cauchy-Schwartz inequality, 

(3.9) 23? SEdt<^J^ \S\\E\ dt <^ (^J^ \S\^d?j (^j^ \E\^dt^ . 



If \E\^dt is smaller than \S\^dt, then so is the right side of the inequality above. We 
have the general principle that if the "main error term" is smaller than the main term, then 
so are the cross terms. It remains only to evaluate \ S\^dt and estimate \E\^dt. 
For the main term use ()3.5p with a„ = n~^: 

r \S\^dt = Yl -iT + 0{n)) 

^ 1 ^„ ^rri ^ 



l<ra<T 



T(logT + 0(l))+ Yl 



l<n<T 

(3.10) = TlogT + 0(T). 



For the error term we have 



(3.11) / \E\^dt<^T I j^—-—dt<^T, 



^ 1 
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which is smaller than the main term. We have just proven the mean value result 

(3.12) [ \Cil + zt)\^dt = T\ogT + 0{T\og^T). 

Jo 

Note that we do not obtain an error term of 0{T) in ()3.12j) by these methods. However, 
a much better error term can be obtained. The first step toward this is given in the next 
section. 

Exercise. Approximate the sum by an integral to show that if A > 1 then 

^-^ A — 1 

n>T 

Conclude that if A > 1 then 

(3.14) Y.\ = C(^) - ?4 + 

n<T 

Exercise. Show that if ^ < a < 1 then 

(3.15) \(;^cr + ^t)\'dt = T|^a2a)-^^j+oi^log--T 

where the implied constant in the big-0 term is independent of a. 

Since the error term in ()3.15|) is uniform in sigma, we can let a ^ to recover ()3.12|) . 
This makes use of the fact that ({s) has a simple pole with residue 1 at s = 1. 

Note that if a > | is independent of T then the right side of ()3.15j) is of size ~ T. On 
the other hand, the second moment on the |-line is of size T log T. Thus there is an abrupt 
change in the behavior of the zeta-function when one moves onto the critical line. Equation 
()3.15|) illustrates that the transition occurs on the scale of 1/ log T from the i-line. 



4. Better 2nd moment 

The methods of the previous section are not sufficient to evaluate the main term of the 
2nd moment with an error term O(T^) for A < 1, nor are those methods sufficient to 
evaluate the 4th moment of the zeta-function. Evaluating the 4th moment by squaring ()2.2|) 
gives a Dirichlet polynomial of length T^, which cannot be handled by the Montgomery- 
Vaughan mean value theorem. So one needs either a shorter approximation to C{s), or an 
approximation to C'^{s) of length < T, or a way to handle longer polynomials. In preparation 
for the 4th moment, we first evaluate the 2nd moment with a better error term. 

The "approximate functional equation" of Hardy and Littlewood expresses the ^-function 
as a sum of two short Dirichlet polynomials: 

(4-1) C(^) =J:^s+ Xis) J:J^s+0 (a:-- + Itl^V"^) , 

l<n<x ^l£n<y 

where xy = t/27r and x{s) is the usual factor in the functional equation ({s) = x(s)C(l — s). 
The name "approximate functional equation" comes from the fact that the right side looks 
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like C{s) when x is large and like x{s)({l — s) when y is large. On the |-line we have 



l<n<Af"'^ l<n<Af"'^ 



where we set x = y = = \ftj^. 

Hardy and Littlewood used the approximate functional equation to evaluate the second 
moment of the zeta-function with a better error term. We will not carry out the calculation, 
but just give the flavor. See Titchmarsh [Tj Section 7.4 for details. Writing ()4.2|) as C = 
S + + -E, we have 

(4.3) I \C,{\^it)fdt = 2 [ |5pdt + 23fJ [ xi^ - ^t) dt + eiioi term. 
Jo Jo Jo 

In that calculation we used the fact that + it)\ = 1. Below we will use the more precise 
information 

(4.4) x{s)=(^y ^^*+-^/^(i + 0(r^)). 



The main difficulty is evaluating the xik ~ it)S'^ term, which equals 



(4.5) ^7= / Xil-^t)inm^'dt. 

n.m ^ 



By fj4.4|) . up to a negligible error that integral is of the form J e^^^^^dt. If /(t) has a stationary 
point in the range of integration then we can extract a main term, otherwise it will become 
an error term. In particular, that integral can be handled by the method of stationary 
phase. See Titchmarsh PP Section 7.4 for details. Our point here is that by virtue of ()4.4|) . 
integrals involving x{\ + and Dirichlet polynomials can be handled. The result that can 
be obtained by the above argument is 



(4.6) 



/ IC(i + ^t)?dt = T logT + (27 - 1)T + 0(T5+^). 
Jo 



The error term can be improved by more sophisticated methods. 

There are many applications of mean values of the zeta-function multiplied by a Dirichlet 
polynomial. Suppose 

(4.7) M{s)= y: I 

l<n<T» 

with bn <^ rf . By the approximate functional equation, C(s)M(s) can be approximated by 
Dirichlet polynomials of length T2+^. So by the above methods, if 6' < | then one should 
be able to find an asymptotic formula for |C^(| + it)\^dt. Evaluating such an integral, 
with ^ = i — £, was key to Levinson's proof that more than one-third of the zeros of the 
zeta-function are on the critical line. Conrey made use of very deep and technical results to 
evaluate such an integral with ^ = | — e, leading to the result that more than two-fifths of 
the zeros are on the critical line. 
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5. The 4th moment 

To evaluate the 4th moment of the zeta-function by the methods described above, one 
requires an approximation to ({^Y of length less than T. This is provided by the following 
approximate functional equation: 

(5.1) CW== E "^^Xisf E §^ + 0(.i-log«), 

where xy = (t/27r)^ and d{n) is the number of divisors of n. More generally one should 
expect an approximate functional equation of the form 

(5.2) C(^)^= E ^ + X{sr E error term 

l<n<x 

where xy ~ t'^ and dk{n) is the fc-fold divisor function 

(5.3) dkin)= E 1' 

which has generating function 

(5.4) c(,r=±^, a>i. 

n=l 

Plugging ()5.H) into the Montgomery- Vaughan mean value theorem leads to 

(5.5) / |C(H^^)I^^^ = ^^log^^+ error term. 
Jo 27r^ 

If you actually do the calculation, you will find that in order to determine the main term 
you need to evaluate sums like 



(5.6) E 



d{n) 



n 

l<n<X 



There is a standard technique for finding the leading-order asymptotics of such sums, which 
is given in the next section. 

5.1. Comments on approximate functional equations. The error term in (|5.1|) is rather 
large and leads to an error term of size 0{T log^ T) in the 4th moment |C(| + it)\'^dt. The 
large size of the error term is due to the fact that our sums have a sharp cut-off. The error 
term can be much reduced by having a smooth weight in the sums. That is. 



(5.7) C{sf = E t) + x{s)'' E t) + small error term. 



n 



where ip and ip* are particular functions that are approximately 1 for n < t2 and decay for 

k 

n > t2 . As our previous discussion should suggest, it is the length of the sums, and not the 
size of the error term, which provides the true difficulty when k > 2. 
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6. Perron's formula 

Here is the problem: you have an arithmetical function a„ and you want to find the 
asymptotics of 



(6.1) S{X)= 



^71 • 

l<n<X 



This problem can often be solved by the most basic methods of analytic number theory. 
First note that for integers > 1, 

1 ^+*^ ds f + error term if A > 1 



(6.2) — — / A* — = < (^-1)! 

27ri J^_iY |o + error term if A < 1 

To see this, consider the integral 

1 f ds 
6.3 

where the integration is over the closed rectangular path connecting the points 



(6.4) Ci 



[l-iY,l + iY,~B + iY,~B -iY], ii A > 1 
[l-iY,l + iY,B + iY,B -iY], ifA<l, 



where is a large positive number. In both cases the main term comes from the residue of 
the pole at 0, which is or is not inside the path of integration. 

Exercise. Bound the error term in ()6.2j) by estimating the integral along the three segments 
of Ci other than [1 — iY, 1 + iY]. You should find that if > 2, then you can let F ^ oo 
and the error vanishes. See Section 3.12 of fT\ if you aren't sure how to begin. 

To evaluate (|6.ip . let 
(6.5) F{s) = J2 



oo 



71=1 



and suppose that the sum converges absolutely for a > ctq. Using ()6.2|) and supposing a > ctq, 
we have 

1 r+''' ^ds A 1 r+'^^rxYds 



oo 



1 + error term if X > n 
+ error term if X < n 
(6.6) = S{X) + error term. 



oo 
n=l 



This is known as Perron's formula. If we can learn enough about the function F[s) so that 
the integral in ()6.6p can be evaluated in another way, then we will have a formula for 5'(X). 

Suppose (as is frequently the case) that F{s) has a pole at ctq and no other poles in the 
half-plane a > o"i for some ai < ctq. Then consider 

JC2(e) ^ 
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where for e > the integration is over the rectangular path with vertices 
(6.8) C2{e) = [a-iY,a + iY, + e + iY,ai + e - iY]. 

We can evaluate ()6.7|) by finding the residue at the pole s = ctq and at s = (if is inside 
the path of integration. And in the same way as you estimated the error term in ()6.2j) . we 
find that ()6.7p equals the integral in Perron's formula plus an error term. The final step of 
bounding the integral on the 3 other segments requires the additional ingredient of a bound 
for F{(7 + it) as t —>■ oo, uniformly for a > ai. 

For example, at the end of the previous section we wanted to evaluate the sum of d{n)'^/n. 

Exercise. Check that 



(6.9) Yl 



C(2s)' 

Hint: both sides have an Euler product. The factors on the right can be found from the 
Euler product for the zeta-function. Those on the left require summing Yl'jLod{p')'^P ''^- 

Thus, we apply Perron's formula ()6.6j) with 

C\s + 1) 



(6.10) F{s) 



C(2s + 2)- 



To determine the analytic properties of F{s), use the fact that ({s) is entire except for a 
simple pole at s = 1, where we have the Laurent expansion 

(6.11) ^(s) = ^ + ^ + ... . 

s — I 

Also C(s) has no zeros in a > 1, and no zeros in a > | assuming the Riemann Hypothesis. 
In these calculations one frequently needs that C(2) = 7r^/6 and 

(6.12) ^(5) = _i_ilog(27r)s + --- 
for s near 0. 

To estimate the error terms, one can use the convexity estimate 

1 (T > 1 

(6.13) + it) < <i |t|^-^^ < a < 1 

Itl^-" a<0, 

along with ({s) ^ 1 for o" > 1. Also, assuming RH we have ^ ({a + it) <^ t^ for cr > \. 
All of these estimates are for fixed cr as t oo. 
Assembling the pieces we find 



(6.14) 



d{n)^ log^X 



l<n<X 



Exercise. Argue that ()6.14|) is of the form XP4(logX) + 0{X^) where P4 is a polynomial 
of degree 4 and B < 1. Find the next-to- leading coefficient of P4, and estimate B both with 
and without assuming the Riemann Hypothesis. 
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Exercise. Deduce the following asymptotics: 

^ 4(n) ~ l^Xlog'X 



k\ 

l<n<X 

(6.15) - 



TT 

l<n<X 



2 

2"" ' 



where (p{n) is the Euler totient function. In addition to the generating function ()5.4j) . you 
should use (and prove): 

^(n) as -I) 



(6.16) 5^ 



as) ■ 



Exercise. Find the next-to-leading order terms in the previous exercise. Also determine the 
shape of the main terms and estimate the size of the error terms, both with and without the 
Riemann Hypothesis. 

Note that there are interesting and important cases where the above analysis is inadequate. 
For example, in the proof of the prime number theorem a„ = A(r;,), the von Mangoldt 
function. Then F[s) has a pole at s = 1 as well as poles at the zeros of the (^-function and 
one must use a more complicated path of integration as well as nontrivial estimates for (^{s) 
in the critical strip. 

7. The conjecture for moments of the zeta-function 

Much recent work on the relationship between L-functions and Random Matrix Theory 
was motivated by the problem of finding conjectures for the 2A;th moment of the Riemann 
zeta-function on the critical line. Conrey and Ghosh |CGj formulated it as follows: for each 
integer /c > there exists an integer such that 

(7.1) f^m + tt)rdt^g,^^T\og''T, 

where 



m 



In this conjecture the only missing ingredient is the integer g^. Keating and Snaith [Kl 
computed the moments of characteristic polynomials of unitary matrices and used the result 
to conjecture 

k-l .| 

(7.3) 9k = k'\X{jr^,- 

It is not trivial to show that this g^ is actually an integer |CFj . 

Our last topic in this paper is to show how the factor arises naturally. From the 
approximate functional equation ()5.2j) it is reasonable to consider 

2 

dk{n) 



(7.4) 



E 



n<T 



dt. 
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Note that the sum has length T. This is good because we can use the mean value theorem. 
But it is bad because the polynomial is not long enough to fully approximate ^{^ + it)''. 
We cannot expect this mean value to equal the 2/cth moment of the zeta-function, but how 
far will it be off? It would be nice if it were off by some simple factor, so one possible 
interpretation of gk is "the number of length T polynomials needed to capture the 2/cth 
moment of the ^-f unction." We do not claim that this was the original reasoning of Conrey 
and Ghosh. 

By the Montgomery- Vaughan mean value theorem the above integral has main term 



(7.5) E 



n 

n<T 



By Perron's formula, to evaluate this we need to find the leading pole of 

n<T 

li k > 2 then F{s) is not a simple expression involving known functions, but fortunately we 
do not require complete information about F{s). 
First note that 



(7.7) = + 



s-1 



so 



<w" = n(i+^+-) 



Thus, if the coefficients of p^'* in an Eulcr product are integers that only depend on j, then 
the coefficient of tells you the order of the pole at s = 1. Since 

dk{p) = E 1 

ni--n/c=P 

(7.9) = k, 

we see that F{s) has a pole of order /c^ at s = 1, that is, F{s) = ak{s — 1)~''^ -\ — ■ , where we 
will show that ak is as given above. To see that F{s) has no other poles in cr > |, note that 

(7.10) c-'(,)f(,) = n(i+A + A+...), 

where the f3j are certain integers that do not grow too fast. In particular, the above Euler 
product converges absolutely for cr > | so it represents a regular function that is bounded 
in (7 > i + £. We have all of the pieces to apply the methods of the previous section, giving 



(7.11) 



2 
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where 

ak = lim(s - 1)^''f(s) 
= limC(s)-'=>(.) 

S—i-l 

(7.12) = n(i--) E^'^^p"^)'^""'- 

p ^ ^ ' m=0 

Note that the product converges because dk{p) = k and dk{n) -C n"^. 

Finally, dkijf^) = C^^™"^)) as can be seen by the following argument. Since dkijP) is the 
number of ways of writing ei + ■ ■ ■ + = m, we can select the Cj by writing down m + k — 1 
circles o and filling in — 1 of them to make a dot •. Then the ej are the number of circles 
between the dots, including the circles before the first and after the last dot. For example, 
here is one configuration that arises from d^lp^): 

• o o • o • • 

(7.13) = 1 p 1 1 



8. The Estermann phenomenon 

The idea behind formula ()7.10|) can be generalized to show that if c{n) is a multiplicative 
function such that c{n) <^ and c{p^) is an integer that is independent of the prime p, then 

ns> - El? 

n=l 

(8.1) = Zjis)l[ajsf^'\ 

j<J 

where the C{j) are integers and Z{s) is regular and bounded in a > 1/J. Thus -F(s), which 
is originally defined for a > 1, has a meromorphic continuation to a > 0. 

Note that F{s) cannot be continued past a = unless C{j) = for almost all j. This is 
because the zeros of the zeta-function lead to zeros or poles of F{s) that accumulate along 
the (7 = line, giving a natural boundary. This is known as "the Estermann phenomenon" . 



9. Appendix: big-0 and < notation 

The statement 

(9.1) /(x) = 0{g{x)) as x ^ oo 
is pronounced "/(x) is big oh of g{x)." It is equivalent to 

(9.2) /(x) ^ g{x) as x ^ oo, 

which is pronounced "/(x) is less than less than g{x)." The symbol <^ is typed as \11 in 
T[HjX. Both of the above statements mean the following: there exists a constant C such that 
if X is sufficiently large then \ f{x) \ < C g{x). The number C is called "the implied constant." 
Note: 

• f{x) <^ g{x) does not mean that f{x) is much smaller than g{x). It is more accurate 
to say that f{x) does not grow faster than g{x). 
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• the above statements have the condition "as x — >• oo" . It is also common to use the 
big-O and <S notation to describe the behavior of a function as x — > 0. Then the 
definition is modified to "if x is sufficiently small" . Usually context makes it clear 
which behavior is being considered. 

• Both notations are useful: the -C does not require parentheses, and the big-0 can be 
used as one term in a formula. 

Here are some examples. Below, A and e are arbitrary fixed positive numbers. 
Examples assuming a; — > oo: 

x^ < 

log(,T) = 0{x) 

log(x) <C x'^ 

x^ = 0(e") 
sin(x) -C 1 

(a; + 2)i° = x^^ + Oix""). 



Examples assuming a; — > 0: 



< x^ 



log(l + x) = 0{x) 

log(l + a;) = x + 0{x'^) 

sin(a;) <^ x 

{x + 2y° = 1024 + 0(x). 



9.1. Little-o notation. The statement 

(9.5) f{x) = o{g{x)) as a; — > oo 
is pronounced "/(a;) is little oh of g{x)." It means 

(9.6) lim ^ = 0. 

x^oo g[x) 

Equivalently, for all C > 0, if a; is sufficiently large then |/(a;)| < Cg{x). It is like big-0 
where the implied constant can be made arbitrarily small. 

Note that f{x) ~ g{x), ''''/{x) is asymptotic to g{x)" is equivalent to f{x) — (l + o(l))g'(a;). 
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